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TN THE CLAIMS : 

Please substitute the following claims lor Ihc same numbered claims in the application. 

Claim 1 (Previously Presented): A method for evaluating and outpulling a final clustering 
solution for a plurality of multi-dimensional data records, said data records having multiple, 
heterogeneous feature spaces represented by feature vectors, said method comprising: 

defining a distortion between two feature vectors as a weighted sum of distortion 
measures on components of said feature vectors; 

clustering said multi-dimensional data records into k-clusters using a convex 
programming formulation; 

selecting feature weights of said feature vectors, and 

minimizing distortion of said k-clusters. 

Claim 2 (Previously Presented): The method according to claim l f wherein said selecting of 
feature weights is optimized by an objective function to produce said final clustering solution 
that simultaneously minimizes average intra-cluster dispersion and maximizes average inter- 
cluster dispersion along all said heterogeneous feature spaces. 

Claim 3 (Original): The method according to claim 1, wherein said clustering includes initially 
applying a local minima of said clustering. 

Claim 4 (Original): The method of claim 1 , wherein said clustering comprises a k-means 
clustering algorithm. 
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Claim 5 (Previously Presented); The method of claim 2, wherein said minimizing distortion of 
individual clusters includes taking said data records and iterativcly determining Vorunui 
partitions until said objective function, between two successive iterations, is less than a specified 
threshold. 

Claim 6 (Original): The method of claim 1, wherein said clustering comprises analyzing word 
data, and said feature vectors comprise multiple- word frequencies of said data records. 

Claim 7 (Previously Presented): The method of claim U wherein said clustering comprises 
analyzing data records having numerical and categorical attributes, and said feature vectors 
comprise linearly-scaled numerical attributes. 

Claim 8 (Previously Presented): A method for evaluating and outputting a clustering solution for 
a plurality of multi-dimensional dala records, said data records having multiple, heterogeneous 
feature spaces represented by feature vectors, said method comprising: 

defining a distortion between two said feature vectors as a weighted sum of distortion 
measures on components of said feature vectors; 

clustering said multi-dimensional dala records into k-clusters using a convex 
programming formulation of a generalized k-means clustering funct ion; and 

selecting optimal feature weights of said feature vectors by an objective function to 
produce said solution of a final clustering that simultaneously minimizes average intra-cluster 
dispersion and maximizes average inter-cluster dispersion along all said feature spaces. 
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Claim 9 (Original): The method of claim 8, wherein said clustering includes initially applying a 
local minima of said clustering. 

Claim 1 0 (Previously Presented): The method of claim 8, wherein said minimizing distortion of 
individual clusters includes taking said data records and itcratively determining Vortmol 
partitions until said objective function, between two successive iterations, is less than a specified 
threshold. 

Claim 1 1 (Original): The method of claim 8, wherein said clustering comprises analyzing word 
data, and said feature vectors comprise multiple-word frequencies of said data records. 

Claim 12 (Previously Presented): The method of claim 8, wherein said clustering comprises 
analyzing data records having numerical and categorical attributes, and said feature vectors 
comprise linearly-scaled numerical attributes. 

Claim 13 (Previously Presented): A computer system for data mining and outpulling a final 
clustering solution, wherein said system includes a memory for storing a database having a 
plurality of multi-dime nsionai data records, each having multiple, heterogeneous feature spaces 
represented by feature vectors, said system including a processor for executing instructions 
comprising: 

defining a distortion between two feature vectors as a weighted sum of distortion 
measures on components of said feature vectors; 

clustering said multi -dimensional data records into k-elustcrs using a convex 
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programming formulation; and 

selecting feature weights of said feature vectors, 

wherein said infitruction for selecting said feature weights arc optimized by implementing 
an objective function to produce said solution of a final clustering that 

Claim 14 (Cancelled), 

Claim 15 (Original): Ihe system of claim 13, wherein said instruction of said clustering includes 
an instruction for initially applying a local minima or said clustering. 

Claim 1 6 (Original): The system of claim 13, wherein said instruction for clustering includes 
instructions for implementing a k-means clustering algorithm. 

Claim 17 (Previously Presented): This system of claim 13, further comprising an instruction for 
minimizing distortion of individual clusters including taking said data records and iteratively 
determining Voronoi partitions until said objective function, between two successive iterations, 
is less than a specified threshold. 

Claim 18 (Original): The system of claim 13, wherein said instruction for clustering includes 
instructions for analyzing word data. 

Claim 19 (Original): The system of claim 13, wherein said instruction for clustering includes 
instructions for analyzing data records having numerical and categorical attributes, 
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Claim 20 (Currently Amended): A program storage device readable by machine, tangibly 
embodying a program of instructions executable by said machine to perform a method for 
evaluating and outputting a final clustering solution from a set of data records having multiple, 
heterogeneous feature spaces represented as feature vectors, said method comprising: 

defining a distortion between two feature vectors as a weighted sum of distortion 
measures on components of said feature vectors; 

clustering said multi-dimensional data records into k-clusters using a convex 
programming formulation; and 

selecting feature weights of said feature vectors, 

wherein said selecting of feature weights are optimized bv an objective function to 
produce said solution of a final clustering that simultaneously minimizes average intra-clustcr 
dispersion and maximizes average inter-cluster dispersion along all said feature spaces. 

Claim 21 (Cancelled). 

Claim 22 (Original): The device of claim 20, wherein said clustering includes initially applying 
a local minima of said clustering. 

Claim 23 (Original): The device of claim 20, wherein said clustering comprises a k-means 
clustering algorithm. 

Claim 24 (Previously Presented): The device of claim 20, wherein said minimizing distortion of 
individual clusters includes taking said data records and itcrativcly determining Voronoi 
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partitions until said objective function, between two successive iterations, is less than a specified 
threshold. 

Claim 25 (Original): 1 he device of claim 20 T wherein said clustering comprises analyzing word 
data, and said feature vectors comprise multiple-word frequencies of said data records. 

Claim 26 (Previously Presented): The device of claim 20, wherein said clustering comprises 
analyzing data records having numerical and categorical attributes, and said feature vectors 
comprj$e linearly-scaled numerical attributes. 



61 



ARC9-2000-0078-US 1 



PAGE 61/63 1 RCVD AT 12130/2004 7:09:58 PM [Eastern Standard Time] * SVR:USPT0«EFXRF-1/1 ' DNIS:8729306 * CS!D:301 261 8825 * DURATION (mm-ss):15-38 



